System and method for dynamic metadata persistence and correlation on API transactions

ABSTRACT

A dynamic metadata persistence and correlation system and method are disclosed. The system and method provide a means of tracking and relating transactional metadata from application API calls to internal data models. This system pairs application level flexibility with dynamic correlation management for entity evolution, data retrieval, and analytics.

PRIORITY CLAIMS/RELATED APPLICATIONS

This application claims priority under 35 USC 120 to and claims the benefit under 35 USC 119(e) to U.S. Provisional Patent Application Ser. No. 62/242,253, filed Oct. 15, 2015 and entitled “System And Method For Dynamic Metadata Persistence And Correlation On API Transactions”, the entirety of which is incorporated herein by reference.

FIELD

The disclosure relates generally to tracking and relating transactional metadata from application API calls to internal data models.

BACKGROUND

Application level API calls may require a level of user flexibility in order to fully encapsulate the data of each request. Additionally, each API call contains transaction data which details the specifics of the data exchange. Both the application level API calls and transaction metadata dynamically create data keys for transfer and persistence into a database. It is desirable to provide to provide a mechanism to facilitate the transfer, persistence and correlation of the application data and the API transactional metadata.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an example of a healthcare system that may incorporate a data persistence system;

FIG. 2 illustrates an example of an implementation of a data persistence system;

FIG. 3 illustrates an example of a health graph database that may be used by the data persistence system;

FIG. 4 details an example of a method for data persistence using the system having the three portions of the architecture shown in FIGS. 1-2;

FIG. 5 illustrates a level to which application level users can provide custom fields for persistence in the graph database;

FIG. 6 illustrates the historical correlation linking process that is part of the data persistence method shown in FIG. 4; and

FIGS. 7A and 7B illustrate an example of the dynamic provenance tracking that occurs during the correlation process.

DETAILED DESCRIPTION OF ONE OR MORE EMBODIMENTS

The disclosure is particularly applicable to a data persistence system and method that is used with the PokitDok healthcare system and method and its database shown in FIG. 1 and it is in this context that the disclosure will be described. It will be appreciated, however, that the system and method has greater utility since it may be used as a standalone system and it may be used with or integrated into other data systems in which it is desirable to facilitate the transfer, persistence and correlation of the application data and the API transactional metadata.

Application level API calls through the PokitDok platform require a level of user flexibility in order to fully encapsulate the data of each request. Additionally, each API call contains transaction data which details the specifics of the data exchange. Both the application level API calls and transaction metadata dynamically create data keys for transfer and persistence into a database. The system and method herein details the transfer, persistence and correlation of a dynamic set of application specific data in addition to API transaction metadata.

The dynamic metadata persistence and correlation system described below provides a means of tracking and relating transactional metadata from application API calls to internal data models. This system pairs application level flexibility with dynamic correlation management for entity evolution, data retrieval, and analytics.

FIG. 1 illustrates an example of a healthcare system 100 that may incorporate a data persistence system that may be implemented in a backend component 108 or a health graph store 114 or in a combination of the backend component 108 and the heath graph store 114. The healthcare marketplace system 100 may have one or more computing devices 102 that connect over a communication path 106 to a backend system 108. Each computing device 102, such as computing devices 102 a, 102 b, . . . , 102 n as shown in FIG. 1, may be a processor based device with memory, persistent storage, wired or wireless communication circuits and a display that allows each computing device to connect to and couple over the communication path 106 to a backend system 108. For example, each computing device may be a smartphone device, such as an Apple Computer product, Android OS based product, etc., a tablet computer, a personal computer, a terminal device, a laptop computer and the like. In one embodiment shown in FIG. 12, each computing device 102 may store an application 104 in memory and then execute that application using the processor of the computing device to interface with the backend system. For example, the application may be a typical browser application or may be a mobile application. The communication path 106 may be a wired or wireless communication path that uses a secure protocol or an unsecure protocol. For example, the communication path 106 may be the Internet, Ethernet, a wireless data network, a cellular digital data network, a WiFi network and the like.

The backend system 108 may also have an API processor component 110 and a health marketplace engine 112 that may be coupled together. Each of these components of the backend system may be implemented using one or more computing resources, such as one or more server computers, one or more cloud computing resources and the like. In one embodiment, the API processor 110 and health marketplace engine 112 may each be implemented in software in which each has a plurality of lines of computer code that are executed by a processor of the one or more computing resources of the backend system. In other embodiments, each of the API processor 110 and health marketplace engine 112 may be implemented in hardware such as a programmed logic device, a programmed processor or microcontroller and the like. The backend system 108 may be coupled to a health graph store 114 that stores the various data and software modules that make up the healthcare system as well as the health graph database that is used to store the health data of the backend component 108. The store 114 may be implemented as a hardware database system, a software database system or any other storage system. In this example implementation, the data persistence system may be incorporated into the backend system 108 or may be coupled to the backend system 108, but located remotely.

The API processor 110 may receive application programming interface (API) requests and communications from an application 104 of one of the computing devices 102 and may in part perform the data persistence process described below with reference to FIGS. 4-5. The health marketplace engine 112 may allow practitioners that have joined the healthcare social community to reach potential clients in ways unimaginable even a few years ago. In addition to giving practitioners a social portal with which to communicate and market themselves with consumers, the marketplace gives each healthcare practitioner the ability to offer their services in an environment that is familiar to users of Groupon, Living Social, or other social marketplaces.

In the system 100, each of the entries for a user may be stored in the store 114 in the health graph with a user's globally unique identifier and the identifiers of other users of the healthcare system that are allowed to view the information to provide, among other things, access control for the data in the healthstream of each user. In the system, each entry may default to being private among the users explicitly specified on the data. Users can choose to change the privacy level to ‘anonymous’ when they want to share information they've learned about a particular health issue with the community without revealing their identity.

FIG. 2 illustrates an example of an implementation of a data persistence system that may be implemented in the backend 108 and/or the store 114 depending on the particular implementation of the data persistence system. In the data persistence system, a piece of dynamic and flexible application level data enters the system through an API call from an application 104 being executed by one of the computing devices 102 as shown in FIG. 2. An example of the API call and the application level data that is contained in the API call is shown in FIG. 5. As shown in FIG. 2, the piece of data may be various types of data in various formats, such as save_entity data, X12 health data, HL7 data (described in more detail at www.hl7.org/implement/standards which is incorporated herein by reference), FHIR data (described in more detail at https://en.wikipedia.org/wiki/Fast_Healthcare_Interoperability_Resources which is incorporated herein by reference), FOAF data (described in more detail at www.foaf-project.org/ which is incorporated herein by reference) and/or JSON data. The API call contains metadata about the request. For example, the metadata could be represented within the “application_data” key field and corresponding JSON payload shown in FIG. 5. The disclosure is not limited to the example in FIG. 5 and the system may utilize other types of metadata that the example shown in FIG. 5.

The data persistence system persists the user specific application data and API metadata in the graph database 114 as shown. The data persistence system correlates each persisted transaction to a unique identifier which tracks the provenance of the dynamic data fields. For example, FIG. 5 contains an example payload to be used in the correlation process. In the process, various correlation techniques may be used. For example, the correlation technique may include strong identifier linkage, partial or full document hashing, streaming weighted clustering, . . . etc.

The data persistence system utilizes this identifier to dynamically correlate the inflow of application data and metadata as shown in FIG. 2 to evolve the entities contained within the database (shown as a graph database for in the example in FIG. 2). An example of the evolving correlation process is shown in FIG. 6 and described below. FIG. 2 details the flow of this procedure from application level to data persistence and concludes with entity correlation. FIG. 3 illustrates an example of a health graph database that may be interfaced by the data persistence system and may be modified/updated by the data persistence system.

FIG. 4 details an example of a method 400 for data persistence using the system having the three portions of the architecture shown in FIGS. 1-2. As shown, the method may be performed/implemented with the application 104 executed on a computing system, the backend system 108 and the healthgraph 114. Initially, an application 104 initializes the system through an API request (402—init( )). An example of this API request is described above. In the API request, the user declares application specific data (application level data) in the API call which is processed by the system 108 in FIGS. 1 and 2. The system 108 processes the API call and transforms it for the intended internal recipient (404—process_request( )). When an API call is received at the backend 108, it is transformed into an internal data model and run against an ensemble of correlation metrics that are executed by the streaming API processor 110. The data from the API may then be further combined with known data models from the health marketplace 112 and the input data stream is connected to the evolving entity (as described below) which is stored in the evolving entity persistence database 114.

The system 108 may then appends the user specific data fields and API metadata fields to the internal transfer (406—insert_application_data( )). An example of the appending of the data fields to the internal transfer and extraction of the identity models from a data stream and correlation to a known identity is shown in FIGS. 7A and 7B where FIG. 7A shows an example of the known identity and the data fields from the API call and FIG. 7B shows the correlation of the data fields in the API call to the known entity.

In the method, the backend system 108 may persist this data in an internal database (408—save_transaction( )) and a unique identifier that tracks the provenance of the dynamic data fields. The unique identifier created for each observed entity can be vector of length n which represents the centroid of the entity's correlated data payloads. As additional payloads are processed and linked, this unique identifier, which can be represented as a centroid vector, is adjusted to account for the appended data. The chaining and linking process is shown in FIG. 6 and the features observed and correlated in FIGS. 7A and 7B represent the evolution of the vector space and the calculable entity centroid for dynamic provenance tracking.

During this process, the fields of the data may be checked against a whitelist for permitted external data fields and persisted according to each key's level of accessibility. The whitelist is autonomously updated to learn, identify and pass user specific fields. Once the database has received and persisted the immutable transaction based on the application level data and the metadata, a process is initialized which will update internal data models via the transactions unique identifier as shown for example in FIGS. 7A and 7B. Furthermore, as additional streaming entities are processed, they are linked to the existing entities which are calculated to have the highest match. This process extracts the application specific data and API metadata and correlates the data from the single request to the entities which it contains (412—correlate_application_data( )). The internal data models are updated and evolved according to the newly observed application data and metadata (414—update_entities( )). In the process, once the data has been saved into the health graph 114, the system 108 may notify the application 104 that the process has been completed (410—notifying( )).

FIG. 5 illustrates a level to which application level users can provide custom fields for persistence in the graph database. In particular, FIG. 5 shows that the system allows for dynamic key value declaration at the API endpoint in the application_data field. The API interface allows for custom key value pairs in this field which, when combined with the metadata from the transaction, are persisted in the database. For example, the transaction's activity_id is used to correlate the dynamic key/values pairs across the transaction database that is part of the health graph database 114. This data is correlated and joined to update the identity of each entity present within the transaction.

FIG. 6 illustrates the evolving correlation process. As payloads 602 are processed in a streaming manner, they are linked with existing and known strong identifiers 604. Simultaneously, these streaming payloads are measured for correlation according to the observed meta data and optimized correlation algorithm as described above. An example payload with strong identifers and meta data for correlation is shown in FIG. 5.

The foregoing description, for purpose of explanation, has been described with reference to specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or to limit the disclosure to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings. The embodiments were chosen and described in order to best explain the principles of the disclosure and its practical applications, to thereby enable others skilled in the art to best utilize the disclosure and various embodiments with various modifications as are suited to the particular use contemplated.

The system and method disclosed herein may be implemented via one or more components, systems, servers, appliances, other subcomponents, or distributed between such elements. When implemented as a system, such systems may include an/or involve, inter alia, components such as software modules, general-purpose CPU, RAM, etc. found in general-purpose computers. In implementations where the innovations reside on a server, such a server may include or involve components such as CPU, RAM, etc., such as those found in general-purpose computers.

Additionally, the system and method herein may be achieved via implementations with disparate or entirely different software, hardware and/or firmware components, beyond that set forth above. With regard to such other components (e.g., software, processing components, etc.) and/or computer-readable media associated with or embodying the present inventions, for example, aspects of the innovations herein may be implemented consistent with numerous general purpose or special purpose computing systems or configurations. Various exemplary computing systems, environments, and/or configurations that may be suitable for use with the innovations herein may include, but are not limited to: software or other components within or embodied on personal computers, servers or server computing devices such as routing/connectivity components, hand-held or laptop devices, multiprocessor systems, microprocessor-based systems, set top boxes, consumer electronic devices, network PCs, other existing computer platforms, distributed computing environments that include one or more of the above systems or devices, etc.

In some instances, aspects of the system and method may be achieved via or performed by logic and/or logic instructions including program modules, executed in association with such components or circuitry, for example. In general, program modules may include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular instructions herein. The inventions may also be practiced in the context of distributed software, computer, or circuit settings where circuitry is connected via communication buses, circuitry or links. In distributed settings, control/instructions may occur from both local and remote computer storage media including memory storage devices.

The software, circuitry and components herein may also include and/or utilize one or more type of computer readable media. Computer readable media can be any available media that is resident on, associable with, or can be accessed by such circuits and/or computing components. By way of example, and not limitation, computer readable media may comprise computer storage media and communication media. Computer storage media includes volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and can accessed by computing component. Communication media may comprise computer readable instructions, data structures, program modules and/or other components. Further, communication media may include wired media such as a wired network or direct-wired connection, however no media of any such type herein includes transitory media. Combinations of the any of the above are also included within the scope of computer readable media.

In the present description, the terms component, module, device, etc. may refer to any type of logical or functional software elements, circuits, blocks and/or processes that may be implemented in a variety of ways. For example, the functions of various circuits and/or blocks can be combined with one another into any other number of modules. Each module may even be implemented as a software program stored on a tangible memory (e.g., random access memory, read only memory, CD-ROM memory, hard disk drive, etc.) to be read by a central processing unit to implement the functions of the innovations herein. Or, the modules can comprise programming instructions transmitted to a general purpose computer or to processing/graphics hardware via a transmission carrier wave. Also, the modules can be implemented as hardware logic circuitry implementing the functions encompassed by the innovations herein. Finally, the modules can be implemented using special purpose instructions (SIMD instructions), field programmable logic arrays or any mix thereof which provides the desired level performance and cost.

As disclosed herein, features consistent with the disclosure may be implemented via computer-hardware, software and/or firmware. For example, the systems and methods disclosed herein may be embodied in various forms including, for example, a data processor, such as a computer that also includes a database, digital electronic circuitry, firmware, software, or in combinations of them. Further, while some of the disclosed implementations describe specific hardware components, systems and methods consistent with the innovations herein may be implemented with any combination of hardware, software and/or firmware. Moreover, the above-noted features and other aspects and principles of the innovations herein may be implemented in various environments. Such environments and related applications may be specially constructed for performing the various routines, processes and/or operations according to the invention or they may include a general-purpose computer or computing platform selectively activated or reconfigured by code to provide the necessary functionality. The processes disclosed herein are not inherently related to any particular computer, network, architecture, environment, or other apparatus, and may be implemented by a suitable combination of hardware, software, and/or firmware. For example, various general-purpose machines may be used with programs written in accordance with teachings of the invention, or it may be more convenient to construct a specialized apparatus or system to perform the required methods and techniques.

Aspects of the method and system described herein, such as the logic, may also be implemented as functionality programmed into any of a variety of circuitry, including programmable logic devices (“PLDs”), such as field programmable gate arrays (“FPGAs”), programmable array logic (“PAL”) devices, electrically programmable logic and memory devices and standard cell-based devices, as well as application specific integrated circuits. Some other possibilities for implementing aspects include: memory devices, microcontrollers with memory (such as EEPROM), embedded microprocessors, firmware, software, etc. Furthermore, aspects may be embodied in microprocessors having software-based circuit emulation, discrete logic (sequential and combinatorial), custom devices, fuzzy (neural) logic, quantum devices, and hybrids of any of the above device types. The underlying device technologies may be provided in a variety of component types, e.g., metal-oxide semiconductor field-effect transistor (“MOSFET”) technologies like complementary metal-oxide semiconductor (“CMOS”), bipolar technologies like emitter-coupled logic (“ECL”), polymer technologies (e.g., silicon-conjugated polymer and metal-conjugated polymer-metal structures), mixed analog and digital, and so on.

It should also be noted that the various logic and/or functions disclosed herein may be enabled using any number of combinations of hardware, firmware, and/or as data and/or instructions embodied in various machine-readable or computer-readable media, in terms of their behavioral, register transfer, logic component, and/or other characteristics. Computer-readable media in which such formatted data and/or instructions may be embodied include, but are not limited to, non-volatile storage media in various forms (e.g., optical, magnetic or semiconductor storage media) though again does not include transitory media. Unless the context clearly requires otherwise, throughout the description, the words “comprise,” “comprising,” and the like are to be construed in an inclusive sense as opposed to an exclusive or exhaustive sense; that is to say, in a sense of “including, but not limited to.” Words using the singular or plural number also include the plural or singular number respectively. Additionally, the words “herein,” “hereunder,” “above,” “below,” and words of similar import refer to this application as a whole and not to any particular portions of this application. When the word “or” is used in reference to a list of two or more items, that word covers all of the following interpretations of the word: any of the items in the list, all of the items in the list and any combination of the items in the list.

Although certain presently preferred implementations of the invention have been specifically described herein, it will be apparent to those skilled in the art to which the invention pertains that variations and modifications of the various implementations shown and described herein may be made without departing from the spirit and scope of the invention. Accordingly, it is intended that the invention be limited only to the extent required by the applicable rules of law.

While the foregoing has been with reference to a particular embodiment of the disclosure, it will be appreciated by those skilled in the art that changes in this embodiment may be made without departing from the principles and spirit of the disclosure, the scope of which is defined by the appended claims. 

The invention claimed is:
 1. A system, comprising: a graph database that stores data about one or more entities associated with a healthcare computer system, the graph database having one or more nodes and one or more edges connecting each node and each node storing data about an entity and having a unique identifier of the entity; the healthcare computer system having a processor, memory and a plurality of instructions that is configured to: receive an application programming interface call containing application level data for a request for the entity and metadata about the request; a data persistence computer system, coupled to the healthcare computer system, having a processor, memory and a plurality of instructions that is configured to: persist the application level data and the metadata for the entity from the application programming interface call into the graph database and correlate, using the unique identifier of entity, each piece of persisted application level data and metadata to the entity already stored in the graph database; and wherein the entity already stored in the graph database is updated to include the correlated persisted application level data and metadata.
 2. The system of claim 1, wherein the healthcare computer system receives data about a health care transaction through the API call.
 3. The system of claim 2, wherein the data about a health care transaction is one of a X12 data format, a HL7 data format, a FHIR data format, a FOAF data format and a JSON data format.
 4. The system of claim 1, wherein the data persistence computer system correlates each piece of persisted application level data to a unique identifier in the graph database to correlate the persisted application level data.
 5. The system of claim 4, wherein the data persistence computer system tracks a provenance of the application level data using the unique identifier.
 6. A data persistence method, comprising: providing a graph database that stores data about one or more entities associated with a healthcare computer system, the graph database having one or more nodes and one or more edges connecting each node and each node storing data about an entity and having a unique identifier of the entity; receiving, at a healthcare computer system, application level data for a request for the entity through an application programming interface call, the application programming interface call having metadata about the request; persisting, in the graph database that is part of the healthcare computer system, the application level data and the metadata for the entity from the application programming interface call; correlating, in a data persistence computer system coupled to the healthcare computer system, each piece of persisted application level data and metadata using the unique identifier of entity to the entity already stored in the graph database; and updating, by the healthcare computer system, the entity already stored in the graph database to include the correlated persisted application level data and metadata.
 7. The method of claim 6, wherein receiving the application level data further comprises receiving data about a health care transaction.
 8. The method of claim 7, wherein the data about a health care transaction is one of a X12 data format, a HL7 data format, a FHIR data format, a FOAF data format and a JSON data format.
 9. The method of claim 6, wherein correlating each piece of persisted application level data and metadata further comprises correlating each piece of persisted application level data to a unique identifier in the graph database to correlate the persisted application level data.
 10. The method of claim 9, wherein corrected each piece of persisted application level data to a unique identifier further comprises tracking a provenance of the application level data using the unique identifier. 